Dataset info
| Number of variables | 22 |
|---|---|
| Number of observations | 84548 |
| Missing cells | 0 (0.0%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 14.2 MiB |
| Average record size in memory | 176.0 B |
Variables types
| Numeric | 9 |
|---|---|
| Categorical | 12 |
| Boolean | 0 |
| Date | 0 |
| URL | 0 |
| Text (Unique) | 0 |
| Rejected | 1 |
| Unsupported | 0 |
Warnings
ADDRESS has a high cardinality: 67563 distinct values | Warning |
APARTMENT_NUMBER has a high cardinality: 3989 distinct values | Warning |
BUILDING_CLASS_AT_PRESENT has a high cardinality: 167 distinct values | Warning |
BUILDING_CLASS_AT_TIME_OF_SALE has a high cardinality: 166 distinct values | Warning |
COMMERCIAL_UNITS is highly skewed (γ1 = 214.4011234) | Skewed |
COMMERCIAL_UNITS has 79429 (93.9%) zeros | Zeros |
EASE-MENT has constant value " " | Rejected |
GROSS_SQUARE_FEET has a high cardinality: 5691 distinct values | Warning |
LAND_SQUARE_FEET has a high cardinality: 6062 distinct values | Warning |
NEIGHBORHOOD has a high cardinality: 254 distinct values | Warning |
RESIDENTIAL_UNITS is highly skewed (γ1 = 60.70273283) | Skewed |
RESIDENTIAL_UNITS has 24783 (29.3%) zeros | Zeros |
SALE_DATE only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
SALE_DATE has a high cardinality: 364 distinct values | Warning |
SALE_PRICE has a high cardinality: 10008 distinct values | Warning |
TOTAL_UNITS is highly skewed (γ1 = 63.44833684) | Skewed |
TOTAL_UNITS has 19762 (23.4%) zeros | Zeros |
YEAR_BUILT has 6970 (8.2%) zeros | Zeros |
ZIP_CODE has 982 (1.2%) zeros | Zeros |
ADDRESS
Categorical
| Distinct count | 67563 |
|---|---|
| Unique (%) | 79.9% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 131-05 40TH ROAD | 210 |
|---|---|
| 429 KENT AVENUE | 158 |
| 169 WEST 95TH STREET | 153 |
| Other values (67560) |
| Value | Count | Frequency (%) | |
| 131-05 40TH ROAD | 210 | 0.2% | |
| 429 KENT AVENUE | 158 | 0.2% | |
| 169 WEST 95TH STREET | 153 | 0.2% | |
| 131-03 40TH ROAD | 147 | 0.2% | |
| 265 STATE STREET | 127 | 0.2% | |
| 550 VANDERBILT AVENUE | 126 | 0.1% | |
| 50 WEST STREET | 115 | 0.1% | |
| 39TH AVENUE | 108 | 0.1% | |
| 30 PARK PLACE | 107 | 0.1% | |
| 1809 EMMONS AVENUE | 103 | 0.1% | |
| Other values (67553) | 83194 | 98.4% |
| Max length | 34 |
|---|---|
| Mean length | 19.26264371 |
| Min length | 5 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
APARTMENT_NUMBER
Categorical
| Distinct count | 3989 |
|---|---|
| Unique (%) | 4.7% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 4 | 298 |
|---|---|
| 3A | 295 |
| Other values (3986) |
| Value | Count | Frequency (%) | |
| 65496 | 77.5% | ||
| 4 | 298 | 0.4% | |
| 3A | 295 | 0.3% | |
| 2 | 275 | 0.3% | |
| 3B | 275 | 0.3% | |
| 2B | 272 | 0.3% | |
| 2A | 263 | 0.3% | |
| 3 | 263 | 0.3% | |
| 1 | 242 | 0.3% | |
| 4B | 228 | 0.3% | |
| Other values (3979) | 16641 | 19.7% |
| Max length | 11 |
|---|---|
| Mean length | 1.34465629 |
| Min length | 1 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
BLOCK
Numeric
| Distinct count | 11566 |
|---|---|
| Unique (%) | 13.7% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 4237.218976 |
|---|---|
| Minimum | 1 |
| Maximum | 16322 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 276 |
| Q1 | 1322.75 |
| Median | 3311 |
| Q3 | 6281 |
| 95-th percentile | 11615.65 |
| Maximum | 16322 |
| Range | 16321 |
| Interquartile range | 4958.25 |
Descriptive statistics
| Standard deviation | 3568.263407 |
|---|---|
| Coef of variation | 0.8421239088 |
| Kurtosis | 0.5968940341 |
| Mean | 4237.218976 |
| MAD | 2910.575662 |
| Skewness | 1.049335039 |
| Sum | 358248390 |
| Variance | 12732503.74 |
| Memory size | 660.7 KiB |
| Value | Count | Frequency (%) | |
| 5066 | 404 | 0.5% | |
| 16 | 255 | 0.3% | |
| 2135 | 211 | 0.2% | |
| 4978 | 187 | 0.2% | |
| 1171 | 181 | 0.2% | |
| 8489 | 170 | 0.2% | |
| 1226 | 168 | 0.2% | |
| 3944 | 152 | 0.2% | |
| 31 | 135 | 0.2% | |
| 1129 | 135 | 0.2% | |
| Other values (11556) | 82550 | 97.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 26 | < 0.1% | |
| 3 | 5 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 2 | < 0.1% | |
| 7 | 2 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 16322 | 1 | < 0.1% | |
| 16319 | 1 | < 0.1% | |
| 16317 | 3 | < 0.1% | |
| 16316 | 2 | < 0.1% | |
| 16315 | 2 | < 0.1% |
BOROUGH
Numeric
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.998758102 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| Median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 1.289790049 |
|---|---|
| Coef of variation | 0.4301080665 |
| Kurtosis | -1.029919869 |
| Mean | 2.998758102 |
| MAD | 1.032064902 |
| Skewness | -0.3250051722 |
| Sum | 253539 |
| Variance | 1.663558371 |
| Memory size | 660.7 KiB |
| Value | Count | Frequency (%) | |
| 4 | 26736 | 31.6% | |
| 3 | 24047 | 28.4% | |
| 1 | 18306 | 21.7% | |
| 5 | 8410 | 9.9% | |
| 2 | 7049 | 8.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 18306 | 21.7% | |
| 2 | 7049 | 8.3% | |
| 3 | 24047 | 28.4% | |
| 4 | 26736 | 31.6% | |
| 5 | 8410 | 9.9% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 5 | 8410 | 9.9% | |
| 4 | 26736 | 31.6% | |
| 3 | 24047 | 28.4% | |
| 2 | 7049 | 8.3% | |
| 1 | 18306 | 21.7% |
BUILDING_CLASS_AT_PRESENT
Categorical
| Distinct count | 167 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| D4 | |
|---|---|
| R4 | |
| A1 | 6753 |
| Other values (164) |
| Value | Count | Frequency (%) | |
| D4 | 12663 | 15.0% | |
| R4 | 12482 | 14.8% | |
| A1 | 6753 | 8.0% | |
| A5 | 5683 | 6.7% | |
| B2 | 4923 | 5.8% | |
| B1 | 4749 | 5.6% | |
| C0 | 4379 | 5.2% | |
| B3 | 3824 | 4.5% | |
| A2 | 2821 | 3.3% | |
| C6 | 2760 | 3.3% | |
| Other values (157) | 23511 | 27.8% |
| Max length | 2 |
|---|---|
| Mean length | 1.991271231 |
| Min length | 1 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
BUILDING_CLASS_AT_TIME_OF_SALE
Categorical
| Distinct count | 166 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| R4 | |
|---|---|
| D4 | |
| A1 | 6751 |
| Other values (163) |
| Value | Count | Frequency (%) | |
| R4 | 12989 | 15.4% | |
| D4 | 12666 | 15.0% | |
| A1 | 6751 | 8.0% | |
| A5 | 5671 | 6.7% | |
| B2 | 4918 | 5.8% | |
| B1 | 4747 | 5.6% | |
| C0 | 4384 | 5.2% | |
| B3 | 3821 | 4.5% | |
| A2 | 2867 | 3.4% | |
| C6 | 2760 | 3.3% | |
| Other values (156) | 22974 | 27.2% |
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | False |
BUILDING_CLASS_CATEGORY
Categorical
| Distinct count | 47 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 01 ONE FAMILY DWELLINGS | |
|---|---|
| 02 TWO FAMILY DWELLINGS | |
| 13 CONDOS - ELEVATOR APARTMENTS | |
| Other values (44) |
| Value | Count | Frequency (%) | |
| 01 ONE FAMILY DWELLINGS | 18235 | 21.6% | |
| 02 TWO FAMILY DWELLINGS | 15828 | 18.7% | |
| 13 CONDOS - ELEVATOR APARTMENTS | 12989 | 15.4% | |
| 10 COOPS - ELEVATOR APARTMENTS | 12902 | 15.3% | |
| 03 THREE FAMILY DWELLINGS | 4384 | 5.2% | |
| 07 RENTALS - WALKUP APARTMENTS | 3466 | 4.1% | |
| 09 COOPS - WALKUP APARTMENTS | 2767 | 3.3% | |
| 04 TAX CLASS 1 CONDOS | 1656 | 2.0% | |
| 44 CONDO PARKING | 1441 | 1.7% | |
| 15 CONDOS - 2-10 UNIT RESIDENTIAL | 1281 | 1.5% | |
| Other values (37) | 9599 | 11.4% |
| Max length | 44 |
|---|---|
| Mean length | 43.00050859 |
| Min length | 43 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
COMMERCIAL_UNITS
Numeric
| Distinct count | 55 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.1935586886 |
|---|---|
| Minimum | 0 |
| Maximum | 2261 |
| Zeros (%) | 93.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 2261 |
| Range | 2261 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 8.713183368 |
|---|---|
| Coef of variation | 45.01571814 |
| Kurtosis | 53950.59279 |
| Mean | 0.1935586886 |
| MAD | 0.3636791662 |
| Skewness | 214.4011234 |
| Sum | 16365 |
| Variance | 75.91956441 |
| Memory size | 660.7 KiB |
| Value | Count | Frequency (%) | |
| 0 | 79429 | 93.9% | |
| 1 | 3558 | 4.2% | |
| 2 | 817 | 1.0% | |
| 3 | 259 | 0.3% | |
| 4 | 137 | 0.2% | |
| 5 | 74 | 0.1% | |
| 6 | 70 | 0.1% | |
| 7 | 31 | < 0.1% | |
| 8 | 26 | < 0.1% | |
| 9 | 20 | < 0.1% | |
| Other values (45) | 127 | 0.2% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 79429 | 93.9% | |
| 1 | 3558 | 4.2% | |
| 2 | 817 | 1.0% | |
| 3 | 259 | 0.3% | |
| 4 | 137 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2261 | 1 | < 0.1% | |
| 436 | 2 | < 0.1% | |
| 422 | 2 | < 0.1% | |
| 318 | 1 | < 0.1% | |
| 254 | 4 | < 0.1% |
EASE-MENT
Constant
This variable is constant and should be ignored for analysis
| Constant value |
|---|
GROSS_SQUARE_FEET
Categorical
| Distinct count | 5691 |
|---|---|
| Unique (%) | 6.7% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| - | |
|---|---|
| 0 | |
| 2400 | 386 |
| Other values (5688) |
| Value | Count | Frequency (%) | |
| - | 27612 | 32.7% | |
| 0 | 11417 | 13.5% | |
| 2400 | 386 | 0.5% | |
| 1800 | 361 | 0.4% | |
| 2000 | 359 | 0.4% | |
| 1600 | 346 | 0.4% | |
| 1440 | 340 | 0.4% | |
| 3000 | 324 | 0.4% | |
| 1200 | 295 | 0.3% | |
| 1280 | 281 | 0.3% | |
| Other values (5681) | 42827 | 50.7% |
| Max length | 7 |
|---|---|
| Mean length | 3.595779912 |
| Min length | 1 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
LAND_SQUARE_FEET
Categorical
| Distinct count | 6062 |
|---|---|
| Unique (%) | 7.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| - | |
|---|---|
| 0 | |
| 2000 | 3919 |
| Other values (6059) |
| Value | Count | Frequency (%) | |
| - | 26252 | 31.0% | |
| 0 | 10326 | 12.2% | |
| 2000 | 3919 | 4.6% | |
| 2500 | 3470 | 4.1% | |
| 4000 | 3044 | 3.6% | |
| 1800 | 1192 | 1.4% | |
| 3000 | 1190 | 1.4% | |
| 5000 | 1009 | 1.2% | |
| 2200 | 512 | 0.6% | |
| 2400 | 486 | 0.6% | |
| Other values (6052) | 33148 | 39.2% |
| Max length | 7 |
|---|---|
| Mean length | 3.648672943 |
| Min length | 1 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
LOT
Numeric
| Distinct count | 2627 |
|---|---|
| Unique (%) | 3.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 376.2240148 |
|---|---|
| Minimum | 1 |
| Maximum | 9106 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 22 |
| Median | 50 |
| Q3 | 1001 |
| 95-th percentile | 1403 |
| Maximum | 9106 |
| Range | 9105 |
| Interquartile range | 979 |
Descriptive statistics
| Standard deviation | 658.136814 |
|---|---|
| Coef of variation | 1.749321649 |
| Kurtosis | 24.93765801 |
| Mean | 376.2240148 |
| MAD | 486.2095452 |
| Skewness | 3.500679349 |
| Sum | 31808988 |
| Variance | 433144.0659 |
| Memory size | 660.7 KiB |
| Value | Count | Frequency (%) | |
| 1 | 4125 | 4.9% | |
| 20 | 983 | 1.2% | |
| 12 | 972 | 1.1% | |
| 40 | 935 | 1.1% | |
| 23 | 911 | 1.1% | |
| 10 | 895 | 1.1% | |
| 15 | 894 | 1.1% | |
| 29 | 891 | 1.1% | |
| 25 | 879 | 1.0% | |
| 19 | 874 | 1.0% | |
| Other values (2617) | 72189 | 85.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 4125 | 4.9% | |
| 2 | 742 | 0.9% | |
| 3 | 811 | 1.0% | |
| 4 | 685 | 0.8% | |
| 5 | 805 | 1.0% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 9106 | 1 | < 0.1% | |
| 9099 | 1 | < 0.1% | |
| 9085 | 1 | < 0.1% | |
| 9081 | 1 | < 0.1% | |
| 9080 | 1 | < 0.1% |
NEIGHBORHOOD
Categorical
| Distinct count | 254 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| FLUSHING-NORTH | 3068 |
|---|---|
| UPPER EAST SIDE (59-79) | 1736 |
| UPPER EAST SIDE (79-96) | 1590 |
| Other values (251) |
| Value | Count | Frequency (%) | |
| FLUSHING-NORTH | 3068 | 3.6% | |
| UPPER EAST SIDE (59-79) | 1736 | 2.1% | |
| UPPER EAST SIDE (79-96) | 1590 | 1.9% | |
| UPPER WEST SIDE (59-79) | 1439 | 1.7% | |
| BEDFORD STUYVESANT | 1436 | 1.7% | |
| MIDTOWN EAST | 1418 | 1.7% | |
| BOROUGH PARK | 1245 | 1.5% | |
| ASTORIA | 1216 | 1.4% | |
| BAYSIDE | 1150 | 1.4% | |
| FOREST HILLS | 1069 | 1.3% | |
| Other values (244) | 69181 | 81.8% |
| Max length | 25 |
|---|---|
| Mean length | 13.14498273 |
| Min length | 4 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
RESIDENTIAL_UNITS
Numeric
| Distinct count | 176 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.025263755 |
|---|---|
| Minimum | 0 |
| Maximum | 1844 |
| Zeros (%) | 29.3% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 1844 |
| Range | 1844 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 16.72103701 |
|---|---|
| Coef of variation | 8.256226859 |
| Kurtosis | 5299.9341 |
| Mean | 2.025263755 |
| MAD | 2.039002171 |
| Skewness | 60.70273283 |
| Sum | 171232 |
| Variance | 279.5930788 |
| Memory size | 660.7 KiB |
| Value | Count | Frequency (%) | |
| 1 | 34722 | 41.1% | |
| 0 | 24783 | 29.3% | |
| 2 | 16049 | 19.0% | |
| 3 | 4608 | 5.5% | |
| 4 | 1346 | 1.6% | |
| 6 | 787 | 0.9% | |
| 8 | 332 | 0.4% | |
| 5 | 273 | 0.3% | |
| 10 | 145 | 0.2% | |
| 16 | 122 | 0.1% | |
| Other values (166) | 1381 | 1.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 24783 | 29.3% | |
| 1 | 34722 | 41.1% | |
| 2 | 16049 | 19.0% | |
| 3 | 4608 | 5.5% | |
| 4 | 1346 | 1.6% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1844 | 2 | < 0.1% | |
| 1641 | 1 | < 0.1% | |
| 948 | 1 | < 0.1% | |
| 894 | 1 | < 0.1% | |
| 889 | 1 | < 0.1% |
SALE_DATE
Categorical
| Distinct count | 364 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 2017-06-29 00:00:00 | 544 |
|---|---|
| 2017-06-15 00:00:00 | 530 |
| 2016-12-22 00:00:00 | 527 |
| Other values (361) |
| Value | Count | Frequency (%) | |
| 2017-06-29 00:00:00 | 544 | 0.6% | |
| 2017-06-15 00:00:00 | 530 | 0.6% | |
| 2016-12-22 00:00:00 | 527 | 0.6% | |
| 2017-05-25 00:00:00 | 511 | 0.6% | |
| 2016-10-06 00:00:00 | 508 | 0.6% | |
| 2016-10-28 00:00:00 | 493 | 0.6% | |
| 2017-03-30 00:00:00 | 493 | 0.6% | |
| 2017-06-30 00:00:00 | 493 | 0.6% | |
| 2016-09-22 00:00:00 | 489 | 0.6% | |
| 2016-09-29 00:00:00 | 474 | 0.6% | |
| Other values (354) | 79486 | 94.0% |
| Max length | 19 |
|---|---|
| Mean length | 19 |
| Min length | 19 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
SALE_PRICE
Categorical
| Distinct count | 10008 |
|---|---|
| Unique (%) | 11.8% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| - | |
|---|---|
| 0 | 10228 |
| 10 | 766 |
| Other values (10005) |
| Value | Count | Frequency (%) | |
| - | 14561 | 17.2% | |
| 0 | 10228 | 12.1% | |
| 10 | 766 | 0.9% | |
| 450000 | 427 | 0.5% | |
| 550000 | 416 | 0.5% | |
| 650000 | 414 | 0.5% | |
| 600000 | 409 | 0.5% | |
| 700000 | 382 | 0.5% | |
| 400000 | 378 | 0.4% | |
| 750000 | 377 | 0.4% | |
| Other values (9998) | 56190 | 66.5% |
| Max length | 10 |
|---|---|
| Mean length | 5.176030184 |
| Min length | 1 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
TAX_CLASS_AT_PRESENT
Categorical
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 1 | |
|---|---|
| 2 | |
| 4 | 6140 |
| Other values (8) |
| Value | Count | Frequency (%) | |
| 1 | 38633 | 45.7% | |
| 2 | 30919 | 36.6% | |
| 4 | 6140 | 7.3% | |
| 2A | 2521 | 3.0% | |
| 2C | 1915 | 2.3% | |
| 1A | 1444 | 1.7% | |
| 1B | 1234 | 1.5% | |
| 2B | 814 | 1.0% | |
| 738 | 0.9% | ||
| 1C | 186 | 0.2% |
| Max length | 2 |
|---|---|
| Mean length | 1.095969154 |
| Min length | 1 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
TAX_CLASS_AT_TIME_OF_SALE
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 1 | |
|---|---|
| 2 | |
| 4 | 6285 |
| Value | Count | Frequency (%) | |
| 1 | 41533 | 49.1% | |
| 2 | 36726 | 43.4% | |
| 4 | 6285 | 7.4% | |
| 3 | 4 | < 0.1% |
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | False |
TOTAL_UNITS
Numeric
| Distinct count | 192 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.249183896 |
|---|---|
| Minimum | 0 |
| Maximum | 2261 |
| Zeros (%) | 23.4% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 2261 |
| Range | 2261 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 18.97258443 |
|---|---|
| Coef of variation | 8.435319348 |
| Kurtosis | 5719.583676 |
| Mean | 2.249183896 |
| MAD | 2.278648393 |
| Skewness | 63.44833684 |
| Sum | 190164 |
| Variance | 359.95896 |
| Memory size | 660.7 KiB |
| Value | Count | Frequency (%) | |
| 1 | 38356 | 45.4% | |
| 0 | 19762 | 23.4% | |
| 2 | 15914 | 18.8% | |
| 3 | 5412 | 6.4% | |
| 4 | 1498 | 1.8% | |
| 6 | 870 | 1.0% | |
| 5 | 423 | 0.5% | |
| 8 | 374 | 0.4% | |
| 10 | 198 | 0.2% | |
| 7 | 197 | 0.2% | |
| Other values (182) | 1544 | 1.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 19762 | 23.4% | |
| 1 | 38356 | 45.4% | |
| 2 | 15914 | 18.8% | |
| 3 | 5412 | 6.4% | |
| 4 | 1498 | 1.8% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2261 | 1 | < 0.1% | |
| 1866 | 2 | < 0.1% | |
| 1653 | 1 | < 0.1% | |
| 955 | 1 | < 0.1% | |
| 902 | 1 | < 0.1% |
Unnamed_0
Numeric
| Distinct count | 26736 |
|---|---|
| Unique (%) | 31.6% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 10344.35988 |
|---|---|
| Minimum | 4 |
| Maximum | 26739 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 849 |
| Q1 | 4231 |
| Median | 8942 |
| Q3 | 15987.25 |
| 95-th percentile | 23281 |
| Maximum | 26739 |
| Range | 26735 |
| Interquartile range | 11756.25 |
Descriptive statistics
| Standard deviation | 7151.779436 |
|---|---|
| Coef of variation | 0.6913699369 |
| Kurtosis | -0.9282200569 |
| Mean | 10344.35988 |
| MAD | 6151.563062 |
| Skewness | 0.4407807646 |
| Sum | 874594939 |
| Variance | 51147949.11 |
| Memory size | 660.7 KiB |
| Value | Count | Frequency (%) | |
| 2047 | 5 | < 0.1% | |
| 5475 | 5 | < 0.1% | |
| 1569 | 5 | < 0.1% | |
| 3616 | 5 | < 0.1% | |
| 5603 | 5 | < 0.1% | |
| 1505 | 5 | < 0.1% | |
| 3552 | 5 | < 0.1% | |
| 5539 | 5 | < 0.1% | |
| 1441 | 5 | < 0.1% | |
| 3488 | 5 | < 0.1% | |
| Other values (26726) | 84498 | 99.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 4 | 5 | < 0.1% | |
| 5 | 5 | < 0.1% | |
| 6 | 5 | < 0.1% | |
| 7 | 5 | < 0.1% | |
| 8 | 5 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 26739 | 1 | < 0.1% | |
| 26738 | 1 | < 0.1% | |
| 26737 | 1 | < 0.1% | |
| 26736 | 1 | < 0.1% | |
| 26735 | 1 | < 0.1% |
YEAR_BUILT
Numeric
| Distinct count | 158 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1789.322976 |
|---|---|
| Minimum | 0 |
| Maximum | 2017 |
| Zeros (%) | 8.2% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1920 |
| Median | 1940 |
| Q3 | 1965 |
| 95-th percentile | 2013 |
| Maximum | 2017 |
| Range | 2017 |
| Interquartile range | 45 |
Descriptive statistics
| Standard deviation | 537.3449934 |
|---|---|
| Coef of variation | 0.3003063173 |
| Kurtosis | 7.146380103 |
| Mean | 1789.322976 |
| MAD | 295.0364004 |
| Skewness | -3.016062029 |
| Sum | 151283679 |
| Variance | 288739.642 |
| Memory size | 660.7 KiB |
| Value | Count | Frequency (%) | |
| 0 | 6970 | 8.2% | |
| 1920 | 6045 | 7.1% | |
| 1930 | 5043 | 6.0% | |
| 1925 | 4312 | 5.1% | |
| 1910 | 3585 | 4.2% | |
| 1950 | 3156 | 3.7% | |
| 1960 | 2654 | 3.1% | |
| 1940 | 2456 | 2.9% | |
| 1931 | 2246 | 2.7% | |
| 1955 | 1961 | 2.3% | |
| Other values (148) | 46120 | 54.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 6970 | 8.2% | |
| 1111 | 1 | < 0.1% | |
| 1680 | 1 | < 0.1% | |
| 1800 | 37 | < 0.1% | |
| 1826 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2017 | 6 | < 0.1% | |
| 2016 | 794 | 0.9% | |
| 2015 | 1470 | 1.7% | |
| 2014 | 1232 | 1.5% | |
| 2013 | 743 | 0.9% |
ZIP_CODE
Numeric
| Distinct count | 186 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 10731.99161 |
|---|---|
| Minimum | 0 |
| Maximum | 11694 |
| Zeros (%) | 1.2% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10011 |
| Q1 | 10305 |
| Median | 11209 |
| Q3 | 11357 |
| 95-th percentile | 11427 |
| Maximum | 11694 |
| Range | 11694 |
| Interquartile range | 1052 |
Descriptive statistics
| Standard deviation | 1290.879147 |
|---|---|
| Coef of variation | 0.1202832795 |
| Kurtosis | 52.53929708 |
| Mean | 10731.99161 |
| MAD | 676.1041496 |
| Skewness | -6.656320824 |
| Sum | 907368427 |
| Variance | 1666368.973 |
| Memory size | 660.7 KiB |
| Value | Count | Frequency (%) | |
| 10314 | 1687 | 2.0% | |
| 11354 | 1384 | 1.6% | |
| 11201 | 1324 | 1.6% | |
| 11235 | 1312 | 1.6% | |
| 11234 | 1165 | 1.4% | |
| 11375 | 1144 | 1.4% | |
| 10312 | 1088 | 1.3% | |
| 10306 | 1061 | 1.3% | |
| 10023 | 1053 | 1.2% | |
| 10011 | 1048 | 1.2% | |
| Other values (176) | 72282 | 85.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 982 | 1.2% | |
| 10001 | 204 | 0.2% | |
| 10002 | 328 | 0.4% | |
| 10003 | 812 | 1.0% | |
| 10004 | 95 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 11694 | 273 | 0.3% | |
| 11693 | 142 | 0.2% | |
| 11692 | 157 | 0.2% | |
| 11691 | 435 | 0.5% | |
| 11436 | 312 | 0.4% |
First rows
| ADDRESS | APARTMENT_NUMBER | BLOCK | BOROUGH | BUILDING_CLASS_AT_PRESENT | BUILDING_CLASS_AT_TIME_OF_SALE | BUILDING_CLASS_CATEGORY | COMMERCIAL_UNITS | EASE-MENT | GROSS_SQUARE_FEET | LAND_SQUARE_FEET | LOT | NEIGHBORHOOD | RESIDENTIAL_UNITS | SALE_DATE | SALE_PRICE | TAX_CLASS_AT_PRESENT | TAX_CLASS_AT_TIME_OF_SALE | TOTAL_UNITS | Unnamed_0 | YEAR_BUILT | ZIP_CODE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 153 AVENUE B | 392 | 1 | C2 | C2 | 07 RENTALS - WALKUP APARTMENTS | 0 | 6440 | 1633 | 6 | ALPHABET CITY | 5 | 2017-07-19 00:00:00 | 6625000 | 2A | 2 | 5 | 4 | 1900 | 10009 | ||
| 1 | 234 EAST 4TH STREET | 399 | 1 | C7 | C7 | 07 RENTALS - WALKUP APARTMENTS | 3 | 18690 | 4616 | 26 | ALPHABET CITY | 28 | 2016-12-14 00:00:00 | - | 2 | 2 | 31 | 5 | 1900 | 10009 | ||
| 2 | 197 EAST 3RD STREET | 399 | 1 | C7 | C7 | 07 RENTALS - WALKUP APARTMENTS | 1 | 7803 | 2212 | 39 | ALPHABET CITY | 16 | 2016-12-09 00:00:00 | - | 2 | 2 | 17 | 6 | 1900 | 10009 | ||
| 3 | 154 EAST 7TH STREET | 402 | 1 | C4 | C4 | 07 RENTALS - WALKUP APARTMENTS | 0 | 6794 | 2272 | 21 | ALPHABET CITY | 10 | 2016-09-23 00:00:00 | 3936272 | 2B | 2 | 10 | 7 | 1913 | 10009 | ||
| 4 | 301 EAST 10TH STREET | 404 | 1 | C2 | C2 | 07 RENTALS - WALKUP APARTMENTS | 0 | 4615 | 2369 | 55 | ALPHABET CITY | 6 | 2016-11-17 00:00:00 | 8000000 | 2A | 2 | 6 | 8 | 1900 | 10009 | ||
| 5 | 516 EAST 12TH STREET | 405 | 1 | C4 | C4 | 07 RENTALS - WALKUP APARTMENTS | 0 | 9730 | 2581 | 16 | ALPHABET CITY | 20 | 2017-07-20 00:00:00 | - | 2 | 2 | 20 | 9 | 1900 | 10009 | ||
| 6 | 210 AVENUE B | 406 | 1 | C4 | C4 | 07 RENTALS - WALKUP APARTMENTS | 0 | 4226 | 1750 | 32 | ALPHABET CITY | 8 | 2016-09-23 00:00:00 | 3192840 | 2B | 2 | 8 | 10 | 1920 | 10009 | ||
| 7 | 520 EAST 14TH STREET | 407 | 1 | C7 | C7 | 07 RENTALS - WALKUP APARTMENTS | 2 | 21007 | 5163 | 18 | ALPHABET CITY | 44 | 2017-07-20 00:00:00 | - | 2 | 2 | 46 | 11 | 1900 | 10009 | ||
| 8 | 141 AVENUE D | 379 | 1 | D5 | D5 | 08 RENTALS - ELEVATOR APARTMENTS | 0 | 9198 | 1534 | 34 | ALPHABET CITY | 15 | 2017-06-20 00:00:00 | - | 2 | 2 | 15 | 12 | 1920 | 10009 | ||
| 9 | 629 EAST 5TH STREET | 387 | 1 | D9 | D9 | 08 RENTALS - ELEVATOR APARTMENTS | 0 | 18523 | 4489 | 153 | ALPHABET CITY | 24 | 2016-11-07 00:00:00 | 16232000 | 2 | 2 | 24 | 13 | 1920 | 10009 |
Last rows
| ADDRESS | APARTMENT_NUMBER | BLOCK | BOROUGH | BUILDING_CLASS_AT_PRESENT | BUILDING_CLASS_AT_TIME_OF_SALE | BUILDING_CLASS_CATEGORY | COMMERCIAL_UNITS | EASE-MENT | GROSS_SQUARE_FEET | LAND_SQUARE_FEET | LOT | NEIGHBORHOOD | RESIDENTIAL_UNITS | SALE_DATE | SALE_PRICE | TAX_CLASS_AT_PRESENT | TAX_CLASS_AT_TIME_OF_SALE | TOTAL_UNITS | Unnamed_0 | YEAR_BUILT | ZIP_CODE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 84538 | 178 DARNELL LANE | 7316 | 5 | B2 | B2 | 02 TWO FAMILY DWELLINGS | 0 | 1300 | 3215 | 61 | WOODROW | 2 | 2017-06-30 00:00:00 | - | 1 | 1 | 2 | 8404 | 1995 | 10309 | ||
| 84539 | 137 DARNELL LANE | 7316 | 5 | B2 | B2 | 02 TWO FAMILY DWELLINGS | 0 | 1300 | 3016 | 85 | WOODROW | 2 | 2016-12-30 00:00:00 | - | 1 | 1 | 2 | 8405 | 1995 | 10309 | ||
| 84540 | 125 DARNELL LANE | 7316 | 5 | B2 | B2 | 02 TWO FAMILY DWELLINGS | 0 | 1300 | 3325 | 93 | WOODROW | 2 | 2016-10-31 00:00:00 | 509000 | 1 | 1 | 2 | 8406 | 1995 | 10309 | ||
| 84541 | 112 ROBIN COURT | 7317 | 5 | B2 | B2 | 02 TWO FAMILY DWELLINGS | 0 | 2160 | 11088 | 126 | WOODROW | 2 | 2016-12-07 00:00:00 | 648000 | 1 | 1 | 2 | 8407 | 1994 | 10309 | ||
| 84542 | 41 SONIA COURT | 7339 | 5 | B9 | B9 | 02 TWO FAMILY DWELLINGS | 0 | 1800 | 3020 | 41 | WOODROW | 2 | 2016-12-01 00:00:00 | - | 1 | 1 | 2 | 8408 | 1997 | 10309 | ||
| 84543 | 37 QUAIL LANE | 7349 | 5 | B9 | B9 | 02 TWO FAMILY DWELLINGS | 0 | 2575 | 2400 | 34 | WOODROW | 2 | 2016-11-28 00:00:00 | 450000 | 1 | 1 | 2 | 8409 | 1998 | 10309 | ||
| 84544 | 32 PHEASANT LANE | 7349 | 5 | B9 | B9 | 02 TWO FAMILY DWELLINGS | 0 | 2377 | 2498 | 78 | WOODROW | 2 | 2017-04-21 00:00:00 | 550000 | 1 | 1 | 2 | 8410 | 1998 | 10309 | ||
| 84545 | 49 PITNEY AVENUE | 7351 | 5 | B2 | B2 | 02 TWO FAMILY DWELLINGS | 0 | 1496 | 4000 | 60 | WOODROW | 2 | 2017-07-05 00:00:00 | 460000 | 1 | 1 | 2 | 8411 | 1925 | 10309 | ||
| 84546 | 2730 ARTHUR KILL ROAD | 7100 | 5 | K6 | K6 | 22 STORE BUILDINGS | 7 | 64117 | 208033 | 28 | WOODROW | 0 | 2016-12-21 00:00:00 | 11693337 | 4 | 4 | 7 | 8412 | 2001 | 10309 | ||
| 84547 | 155 CLAY PIT ROAD | 7105 | 5 | P9 | P9 | 35 INDOOR PUBLIC AND CULTURAL FACILITIES | 1 | 2400 | 10796 | 679 | WOODROW | 0 | 2016-10-27 00:00:00 | 69300 | 4 | 4 | 1 | 8413 | 2006 | 10309 |